CVS $Revision: 1.19 $Date: 2008/08/25 15:39:40 



Comprehensive Two-Point Analyses of Weak Gravitational Lensing Surveys 

Gary M. Bernstein 

Dept. of Physics and Astronomy, University of Pennsylvania, Philadelphia, PA 19104 

garybOphysics . upenn . edu 

ABSTRACT 

We present a framework for analyzing weak gravitational lensing survey data, in- 
cluding lensing and source-density observables, plus spectroscopic redshift calibration 
data. All two-point observables are predicted in terms of parameters of a perturbed 
Robertson- Walker metric, making the framework independent of the models for grav- 
ity, dark energy, or galaxy properties. For Gaussian fluctuations the 2-point model 
determines the survey likelihood function and allows Fisher-matrix forecasting. The 
framework includes nuisance terms for the major systematic errors: shear measurement 
errors, magnification bias and redshift calibration errors, intrinsic galaxy alignments, 
and inaccurate theoretical predictions. We propose flexible parameterizations of the 
many nuisance parameters related to galaxy bias and intrinsic alignment. For the first 
time we can integrate many different observables and systematic errors into a single 
analysis. As a first application of this framework, we demonstrate that: uncertainties 
in power-spectrum theory cause very minor degradation to cosmological information 
content; nearly all useful information (excepting baryon oscillations) is extracted with 
~ 3 bins per decade of angular scale; and the rate at which galaxy bias varies with 
redshift substantially influences the strength of cosmological inference. The framework 
will permit careful study of the interplay between numerous observables, systematic 
errors, and spectroscopic calibration data for large weak-lensing surveys. 

Subject headings: gravitational lensing; cosmological parameters; relativity 



1. Introduction 



Weak gravitational lensing of background sources can produce exceptionally strong constraints 
on cosmological parameters and tests of General Relativity. Initial analyses considered the 2-point 
correlation function (or, equivalent ly, power spectrum) of the shear pattern induced on a si ngle 



population of background galaxies (|Miralda-Escudelll99ll : lKaiserlll992l ; iBlandford et al.lll99ll ). A 



wealth of new statistics, however, have been suggested as more powerful means to extract infor- 
mation from weak lensing (WL): cross-power spectra of multiple source populations with distinct 
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redshift distribu tions (a.k.a. "tomo graphy") (|Hulll999l ); the correlation of shear with foreground 
galaxy clusters ()Jain k Tavlorll2003l). or more generally the cross-correlation of lensing shear with 
the galaxy distribution (jBernstein k Jainl 12004 ; IZhang. Hui. k Stebbinsl l2003l ) ; joint analyses o f 
density-density, density-shear, and shear-shear corre lations in a n imaging survey dHu fc Jaujl2004h: 
cross - correlation of magnifi cation as well as shear ( Jain 2002 ); use of the CMB (Ihu k Qkamotc " 



2002 



Zahn k Zaldarriagall200fil ) as source planes; c ross-correlation of source density or s hear with a dis- 



Hirata k Seliakll2003l ) or recombination-era 21 cm sig nals dMetcalf k WhitdboOTl : Pen 



tinct spectroscopi c galaxy survey popu lation (INewman 



2008 



Schneider et al 



2006 ) ; and the use of 



3-point statistics ( Takada k Jainl 2004 ) or statistics such as peak counts ( Hennawi k Spergel 2005 ; 
Wang. Khourv k Haimanl [2004J; iMarian k Bernstein! 120061 ) to move beyond 2-point information. 
Each of these potential innovations has been individually analyzed and shown to improve cosmo- 
logical constraints. The first goal of this paper is to consider the simultaneous use of all of these 
observable statistics: can we forecast the cosmological information that they will yield collectively 
in future surveys? Can we start to develop a framework in which all these signals could be analyzed 
simultaneously in a real experiment? 

In parallel with the increasing variety of proposed WL signals, the community has identi- 
fied a series of potential astrophysical and instrumental non-idealities in WL data which, if ig- 
nored, would lead to substantial systematic errors in the inferred cosmology. These include: fi- 
nite accuracy in ou r ability to predict the deflecting mass power spec trum due to nonlinearities 
(jJain k Seliak!ll997l ) and baryonic physics dZhanll2006l ; iJing et aZ.ll2006l ) : intrinsic alignments (IA) 
between gal axy shapes (Croft k Metzler 200ol ) and between galaxy shapes and the local mass 
distribution ( Hirata k Seljak 20041 ) that are not induced by lensi ng; multiplicative "shear calibra- 



tion" errors in the derivation of lensing shear from galaxy images (jlshak et al.ll2004l : iHuterer et al 



2006 ); additive "spurious shear" due to uncor rected PSF ellipticity or other imaging systematics 



source populations (|Ma. Hu. k Huterei 



(jHuterer et al.l 120061: lAmara k Refreeier 120071 ) ; and errors in the assignment of redshifts to the 



200 



(]). The impact of these systematic-error sources on 



cosmological inferences have been analyzed by different means, but a second goal of this paper is 
to produce a comprehensive forecast that considers the presence of them all simultaneously. 

Previous work has shown that these multiple sources of information and systematic in WL 
surveys can interact in interesting ways. For example, in the presence of tomographic data, many 
systematics are readily distinguishable from cosmological signals and can hence be diagno sed and 
corrected internally to a survey; this approach is called self- calibration (jHuterer et al.ll2006l ). It has 
also been shown that co mbining galaxy density a nd lensing correlations can lead to self-ca libration of 
shear calibration errors (jBernstein k Jainll2004l ) and the uncertainties in galaxy biasing ([Hu k Jain 
20041 ; IZhanl l2006). Intri nsic alignments of galax ies can be diagnosed and corrected if tomographic 
information is available (jKing k Schneideribood ) . however this places substantially g reater demands 
on th e precision and accuracy of redshift assignment than would otherwise be needed ([Bridle k King 
20071 ). These investigations raise important practical questions: will the self-calibration techniques 
continue to succeed when we attempt to simultaneously self-calibrate several different systematic 
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errors? Do cross-correlation techniques reduce uncertainties in redshift distributions to negligible 
levels, or is it necessary to m ake a complete spectro scopic redshift survey of some size to measure 



redshift distributions directly (|Ma &: Bernsteinll2008l )? This paper will present a formalism through 



which all these questions can be answered, but we defer to later papers the application of the 
framework to these issues. 

A third goal of this work is to describe the constraints by WL in a language that is not tied 
to a specific cosmological model. Most forecasts for WL survey constraints are done within the 
context of a Universe that has homogeneous dark energy with equation of state uu = wq + w a (l — a). 
Projecting the WL experiment onto this model gives concrete predictions, but obscures what the 
WL is really measuring. So the analysis framework presented here will be dark-energy agnostic, 
meaning that no specific model is assumed. We will be very explicit about the assumptions made 
in the analysis and try to keep them to a minimum. In fact a great strength of WL experiments are 
their ability to test General Relativity itself, so we seek an analysis method that is g eneral enough 



to incorporate such tests. Similar to the approach of iKnox. Song, Tysonl (|2006l ). our analysis 
results in constraints on the distance and growth functions D(z) and g^,(z), without reference to 
the particular dark-energy or gravity modifications that might cause deviations from ACDM. 

In the following section we describe a "kitchen-sink" formalism for WL survey observables 
that allows the incorporation of all suggested 2-point statistics and very general treatments of 
nearly all proposed systematic errors. In $3] we give a likelihood function and Fisher matrix for 
an unbiased spectroscopic redshift survey of source galaxies. Then we briefly describe a software 
implementation of the lensing and spectroscopy likelihood calculations. We describe our model for 
the evolution of the lensing-potential power spectrum in £j5l and £j6]we describe generic models used 
for the nuisance functions required in the lensing-survey analysis. In $7]we use the implementation 
of these methods to investigate the proper choices for the bin sizes and grid spacings needed to 
turn the lensing analysis into a tractable finite-dimensional problem. Further application of the 
framework to survey forecasting will be done in future papers. 

An earlier versi on of this WL analy sis formalism was used to generate forecasts for the Dark 



Energy Task Force (jAlbrecht et al.1 12006). and is described in an appendix to that report. 



2. The Weak Lensing 2-Point Likelihood 
2.1. Observables 

We make the assumption that the Universe has only weak scalar perturbations to a homogeneous 
and isotropic 4- dimensional metric. In this case the metric can be written in the Newtonian gauge 
ELS cl perturbed Robertson- Walker metric: 



ds 2 = (1 + 2*)dt 2 - a 2 (t)(l + 2$) [d X 2 + xlS 2 ( X /xo)(d9 2 + sin 2 9d^ 2 )] 



(1) 
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We assign all mass and sources in the Universe to a series of narrow spherical shells centered 
at redshifts a% = (1 + for i G {1, . . . , N z }. There is a comoving angular diameter distance Di 
to each shell, and the comoving radial extent of each shell is Axi- Note that the Robertson- Walker 
metric formula for angular-diameter distance is D = XoSk(x/xo), where xo is the comoving radius 
of curvature of the Universe. For small values of the curvature uj k = — k/xo we have 



A Xi « AD(1 - oj k Df/2) 



D; 



[l-u k D 2 j2). 



(2) 



The Robertson- Walker metric also requires Ax-i = Azi/h(zi) = Aai/a 2 h(ai). 
Hubble parameter will be written as H(z) = h(z)Hioo, -Hioo = lOOkms -1 Mpc" 
will be in units of c/H W0 = 2998 Mpc. 



In this paper the 
1 , and all distances 



We assume that the photon sources in a survey will be divided into a series of sets a € 
|1, 2, . . . , iy . sl- N ote the use of latin indices for redshift shells, greek for source sets. We follow 



Hu &: Jainl (|2004h by assigning each source set up to two observables: first its sky-plane density 
fluctuations g a (0,<p), and second a lensing convergence K a (9,(p). The convergence k might be in- 
ferred from the shear or flexion ([Goldberg Baconl l2005) of galaxies, by a quadratic estimator on 
the CMB or 21-cm radiation fluctuations, or by any other observable except the source density. 
The sou rces can be assig ned to sets by photometric or spectroscopic redshift, or even cruder color 
criteria (|Jain et al.l 120071 ). but there could be other criteria such as galaxy type, or perhaps obser- 
vation by different instruments. We demand only that the criteria for division of the sources be 
spatially homogeneous, and that the division be invariant under application of gravitational lensing 
distortion. For notational convenience we assign each set a nominal redshift z a , but a set can span 
a broad redshift range. If the sources are discrete objects such as galaxies, then the mean density 
on the sky of members of each set are denoted n a . 

A source in set a has a probability p a i of lying on redshift shell i. The collection of galaxies 
in set a on shell i will be called the subset ai. The survey is assumed to tell us only which set 
any individual galaxy belongs to, but not which subset. The p a i are parameters which must be 
constrained by the lensing survey data or by additional observations, e.g. a spectroscopic redshift 
survey. 

When the lensing sources are drawn from a spectroscopic survey (or when the source is the 
CMB), then the redshift probability is known a priori, and in particular the sets are probably 
divided by redshift so that p a i is essentially the identity matrix. The formalism can obviously 
accommodate the simultaneous analysis of WL samples with varying modes of redshift assignment. 

Both the source density fluctuation g a and convergence K a have a component due to intrinsic 
fluctuations plus a component due to gravitational lensing. Both are also measured as weighted 
sums over their respective subsets. We have 



l + 9«{e,<p) = X>™ [1 + ^(0, p)] + 



(3) 
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,<p) + (l + f a 



.lens 



(4) 



Here we have assigned each subset a magnification bias factor q a i and a shear calibration factor f a i- 
In a simple flux-limited selection, the magnification bias factor will be determined by the logarithmic 
slope of the counts vs flux, and is typically of order unity. The shear calibration factor allows for 
the possibility that the inferred lensing convergence is mis-measured by some factor 1 + f n j du e to 
multiplicative errors in the lensing methodology, e.g. as investigated bv lHevmans et al.l (|2006). 

In the limit g mt -C 1 and K lens < 1, we can drop the second-order term in Equation [3] and 
write 



9o 



i 



n int -I- a K lens 
ilea ^ Hai^z 



_int 



+ (1 + /»)« 



lens 

i 



(5) 



(6) 



In this case the equations are linear in all the angular functions g and k, so we can decompose them 
into spherical harmonic coefficient g a im, K ^m' e * c ' an( ^ Equations ([5]) and ([6]) hold independently 
for every harmonic £m. We will henceforth assume that the spherical-harmonic decomposition has 
been executed for all the angular functions g, k, and suppress the tm indices for brevity. 

We note that while «; lens <C 1 is a good approximation over most of the sky, g mt <C 1 is a poor 
approximation for thin density slices on smaller angular scales. We will forge ahead nonetheless 
with the assumption that lensing magnification simply adds to the intrinsic density fluctuations, 
recognizing that a real analysis of data with magnification bias may require inclusion of the nonlinear 
coupling between spherical harmonics that is induced by magnification bias on highly structured 
density fields. 

The lensing convergence is determined entirely by the metric if we make the assumption that 
light rays are following its null geodesies. The paths of null geodesies are determined by the lensing 
potential 

= (7) 
For each of our redshift shells we define a projected lensing potential via 



ba dx, 



Ax 



(8) 



where the derivatives are taken with respect to angles on the sky. We will generally assume that ip, 
like the observables, has been decomposed into spherical harmonics, and we will take the flat-sky 
approximation . 

With the definition 0, and the adoption of the weak-lensing limit and Born approximation, 
the lensing convergence is 



Jens 



^ 3 2a~D 



(9) 
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D 



A; 



TT ~ 11 Dj/Di){l - uokDiDj/2) z t > Zj , 
Zi < Zj. 



(10) 



Dij is the comoving angular diameter distance to Z{ as viewed from Zj. In summary, the observables 
from the survey are, for each spherical harmonic: 



i)j 



l3 2 aj Dj +9m 



(11) 



ti | ... 



mt 

at 



We reiterate that these equations depend only upon the assumption of a Robertson- Walker metric 
with scalar perturbations, plus the approximation that magnification bias and intrinsic density 
fluctuations are additive. 

The equations for the two observables are symmetric under the interchange of g «-► k and 
q <-> (1 + /). Since q ~ 1 + /, the lensing effects are similar. However the intrinsic density 
fluctuations g mt are ~ 300 x stronger than K mt , breaking the symmetry. Density- field observations 
are dominated by the intrinsic signal while convergence (shear) observations are dominated by 
lensing effects. 



2.2. Degeneracies 

Equations (|9l)- (| lip reveal a family of degeneracies present in lensing observations, as described 



m iBernsteinl (|2006l ). The transformations 

Dj -» Dj(l + a + aiDj + a 2 D]) 

ipi -> ^(1 + 00 + 201^ + 202^) (12) 
^fc -»• ^fc + 2o 2 

leave the observables unchanged, to first order in {oo, ot\D, a 2 D 2 , UkD 2 }. It will hence be impossible 
for lensing+density surveys to constrain uj^ or any quadratic (in D) deviations to lnZ?, unless there 
are prior constraints on these variables or on ip, g mt , or K mt . Constraint on these three degeneracies 
is unlikely to arise from models of intrinsic clustering or alignment, since it is unlikely that a priori 
models of the redshift dependence of galaxy bias could reach high precision. We hence expect that 
these degeneracies are going to be broken by theoretical models of the potential fluctuation power 
spectrum, or by other distance indicators such as supernovae or BAO. 
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2.3. Limber Approximation 

To forecast the constraints on the parameters of this model, we require a likelihood expression 
for the observables. The two fundamental assumptions we make are: 

1. The distributions of the lensing potential, intrinsic galaxy density fluctuations, and intrinsic 
shape correlations ipi,g^, and k 1 ^ are described by a multivariate Gaussian with zero mean. 

2. The Limber approximation is valid and there is no correlation between these variables on 
distinct redshift shells or between different spherical harmonics: 

(Xie m Yje> m >) = 5ij5 u >5 mm > (DfAxi) 1 P t XY (£/D i ) 

(Xitmipje'm') = -25ij5i£'5 mm 'ai(£/Di) 2 P x<i> (e/Di) (13) 
(i>itm^jt>m>) = 5 tj 5 u ,5 mm ,Aa 2 l D 2 t & Xl {t/D t fP?' i> (l/D, i ) 

where X, Y € {g mt ,n mt }, and P XY (k) is the 3-d cross-spectrum of variables X and Y at 
epoch cij. 

The first assumption insures that the likelihood of an observation is fully specified by the expected 
covariance matrix of the observables. The second assumption implies that this covariance matrix 
can be expressed in terms of the 3-d cross-power spectra of the lensing potential, subset densities, 
and subset intrinsic correlations {4>i,g^, k 1 ^} at each redshift shell. 

2.4. Biases and Correlations 

A typical convention is to express the galaxy density power P" as a bias-scaled version of the 
mass spectrum P m , plus a Poisson shot-noise contribution, and then describing the mass-galaxy 
covariance P m9 with a correlation coefficient: 

P gg = ( b 9fpm + }_ ( 14 ) 
P 

pmg = h 9 r gpm^ ^ 

The comoving volume number density p of the sources determines the shot noise for a Poisson 
process, but there is no guarantee that the galaxies are distributed in the mass distribution by a 
Poisson process. Even when the galaxies do not have Poissonian shot noise, we can still usually 
write the power in this way for some bias parameter b; we just might keep in mind that r 9 > 1 is 
formally allowed if the sources are not Poisson-distributed. 

Most generally, both the bias and correlation coefficients are different for each source subset 
as well as being functions of comoving wavenumber k. Each set a has a nominal redshift z a , and 
each subset has a redshift deviation Az a i = zi — z a . Galaxies with bad photo-z errors could easily 
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have different bias from those with good photo-z's; for example, highly-biased early types tend to 
have better photo-z's. So our analysis methods should allow for this complication. 

We will adopt the bias/correlation notation for the intrinsic galaxy density fluctuations and for 
the intrinsic convergence K mt , except that we will parameterize the bias and covariance with respect 
to the lensing potential rather than mass distribution. If Pf®a is the 3d cross-power between density 
fluctuations in subsets ad and j3i at wavenumber k, and we write Pf for the lensing-potential 3d 
power spectrum, then we express: 

Pf3 a = b ^ r 99^J^ 1 _) 2 k 4 p f + ^E 

■2 



iaf3 cei f3i a/3i I <^ 



m J Pai 



P KK _ h K IK KK ( 2(1 i \ h ip<t> , 5 <*P a i / 1fi x 

p9* _ h 9 h K ok ( 2a i \ ,4p</> 

where p a i is a comoving volume density of the galaxy subset in the shell and <7 7 is a measure 
of the shear noise per galaxy. These describe the normal "shape noise" term in the shear power 
spectrum and the shot noise in the density field. If flexions or other observables are used to infer 
the convergence, then the shape noise term may have a different form. 

And if J^f is the cross-power between the density of subset ia and lensing potential, we express 

p£ = -Ksl^rt (17) 

p*<t> — W r K 2fl » U2p<j> 
OU! m 

Note that specifying the bias and correlation b K and r K of the intrinsic convergence with the lensing 
p otential is equivalent to giving the "GI" and "II" intrinsic-alignment information, in the notation 



of Iffirata fc Seliakl (|2004 ). 



The lensing power P^ is a function of z and k. The biases and correlation coefficients 
^ai' r ai'^ai' an( ^ r ai are functions of k, the nominal redshift z a of the source set, and Az a i, the 
difference between the subset redshift and the nominal set redshift. 

Most complicated are the cross-correlation coefficients such as which are, most generally, 
functions of k, Zi, and both redshift errors Az a i and Az^j. In order for the covariance matrix of all 
these fields to be symmetric, we require r 9 J ai = r™i = 1 and the symmetry r^Z = r ^ai- Otherwise 
the correlation coefficients are free to vary, subject to the constraint that the overall correlation 
matrix of the potential and all fluctuations must remain non-negative. 

This parameterization of the fluctuations of the potential and the intrinsic fluctuations is 
completely general — we have not introduced any further assumptions into the model as long as all 
the 6's and r's and P^'s are free parameters (non-negative in the last case). 
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2.5. The Two-Point Statistics 



Combining the formula for observables (jlip . the Limber formulae (|13p . and the bias notation 
(fT6|) . (fTTj) . the covariance matrix for the observables at a given multipole can be broken 

into three submatrices: 



C a3 = (daga) 



a3 



*K ) 



^PaiP3j I <7m<?/3j ^ A in A jn AXnKP$(k ri 
ij I n 

+ qai.Aij—i-Dj 1 b 9 /3j r 9 p j kjP't > (kj) 

OLU m 

+ q 3 3 A 3 ^D-^ m r 9 at kfPHh) 

6L0 m 



+ Si 



2a, 
3uj r 



^PaiPfo \ i 1 + X] A in Aj n AXnknP^(k ri 

ij I n 

- (1 + UAj^Dfb^kjpt'ikj) 



2a,; 



+ q p jAji -^D- 1 b« i r K a ikiP*(k) 

OLO m 



+ Si 



2a, 



S ZjV*iP33 \ (1 + /m)(! + S A in ^ n Ax„^P,f (^) 



(1 + /ai)A 



2a; 



2a, 

3tJ r , 



(18) 



(19) 



(20) 



The comoving wavevector is fcj = t/Di. In each equation, note that only one of the last three terms 
is non-zero, depending on whether j < i, j > i, or j = i, respectively. For th e shear-shear correlatio n 
C KK , the i = j term is recognizable as the "II" intrinsic-correlation effect of lHirata Seljakl (|2004l ). 
while the i < j and j > i terms are their "GI" effect. 

Note that the last term in the density-shear expression C Kg is an additional intrinsic-correlation 
term, between the galaxy density and the intrinsic shapes, which is distinct from the covari- 
ance between the lensing potential and shear. This galaxy-shear cor relation has been constraine d 



Faltenbacher et al. 


(2007 


); 


Hirata et al. 


(20041 
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Examining the density-density correlation C" we find the final term has the normal expected 
form, but the first three terms describe correlations induced by lensing magnification. 

Finally we note that the covariance matrix manifests the same symmetries for g <-> k that were 
discussed at the end of §2.11 



2.6. Likelihood and Fisher matrix 

Under our Gaussian assumption, the likelihood functions for the observables are independent 
at each multipole Im. We define a data vector dg m to be the union of the g a and K a observables at 
each multipole, and Ci to be the covariance matrix derived above. Under our Gaussian assumption, 
the total likelihood for the survey is 

-21nL = ^[dLC7 1 d £m + ln|Q|] . (21) 

im 

Forecasts of survey performance are made using the Fisher matrix. The usual formula for zero-mean 



Gaussian distributions applies (jTegmark. Taylor. &: Heavens! 119971 ). We reduce the mode sum to a 



series of Nf bins centered on multipoles then the Fisher matrix element for parameters p and q 



is 



_ ^ (24 + 1) A4 / sky 
i=l 



i% Qp k Qq 



(22) 



Examination of Equations (|18p - (|20p shows that all derivatives of C with respect to parameters are 
very simple. The calculation of the Fisher matrix is reduced to rapid linear algebra, significantly 
accelerated by exploiting the very sparse nature of most of the derivative matrices. 

We have thus succeeded in producing a likelihood function for the most general joint lensing+density 
survey, for the case of Gaussian likelihoods limited to 2-point statistics. Given a likelihood we can 
of course form a Fisher matrix for forecasting, or we can execute a maximum-likelihood analysis 
of real data. Since this likelihood function makes no mention of a particular dark-energy theory, 
we see that the parameterization chosen here permits a highly flexible analysis. Indeed no theory 
of gravity or initial conditions of the Universe have been assumed either, just the existence of a 
Newtonian gauge metric on a RW background cosmology. The lensing-potential power spectrum 
P^(k,z) appears as a series of free parameters, as do the bias and correlation coefficients of the 
galaxy density and intrinsic alignments. 

We have variables that describe, in the most general possible fashion, the important systematic 
errors, excepting additive shear contamination: 

1. Uncertainty in power-spectrum theory will be expressed through prior distributions on the 
Pf parameters. 

2. Shear calibration errors arises through finite prior uncertainty on the / a j. 
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3. Magnification-bias calibration errors arises through finite prior uncertainty on the q a i- 

4. Intrinsic alignments are embodied through the b K , r K , r 9K , and r KK coefficients. 

5. Redshift-distribution errors are manifested through the uncertainties in the p a i probabilities. 

The cost of this great generality is that there are a huge number of nuisance parameters, 
enough to make us doubt whether the maximum-likelihood analysis — or even the Fisher-matrix 
analysis! — is feasible. 



2.7. Parameter Inventory 

The WL survey covariance matrix has a horrendously large number of parameters. The cos- 
mological treasure lies in these: 

• The 2 global cosmological parameters uj m and u>k- 

• The distances Di, which encode the expansion history of the Universe in N z steps. The A\i 
and the Hubble parameters h(zi) can be expressed in terms of these and u>k- 

• The metric-potential power spectra Pf, which describe the growth of dark-matter structure. 
For Ni bins in I, there will be N^N Z distinct matter-power parameters in the model. A 
prediction for the growth of potential fluctuations will typically be an important element of 
any cosmological scenario under test, so the Pf can be replaced as parameters by a much 
smaller number of cosmological parameters. 

There are then a large number of nuisance parameters. If there are N ss non-empty source subsets, 
the nuisance parameters are 

• The redshift-distribution parameters p a i, with N ss — N s degrees of freedom. 

• The shear-calibration errors / a j, another N ss degrees of freedom. 

• The magnification-bias coefficients q a i, another N ss degrees of freedom. 

• The source-density biases b a ai and correlation coefficients r 9 ai with respect to 0, which may be 
scale-dependent, yielding 2N^N SS degrees of freedom. 

• The intrinsic-alignment power and correlations with the mass, and r^, another 2NeN ss 
parameters. 

• The correlation coefficients r^, r^, and r^, which also may be scale-dependent. The 
number of such parameters is ss 3N£Ng S /2N z . 
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The number of nuisance parameters for a non-parametric analysis is enormous. If we are analyzing 
a photo-z survey with typical errors Az « 0.05(1 + z), then we would typically want to space the 
redshift shells logarithmically in 1+z, with A In a ~ 0.02 so that we resolve the redshift distribution 
of each photo-z bin. In this case, N z ~ 100 bins span < z < 5, and we will require N ss > 1000 if 
we track all subsets out to ±3<t of the photo-z distribution. 

To reduce the dimensionality of the likelihood function, we can replace many of the discrete 
nuisance parameters by the values of parameterized functions for the nuisance variables. Table Q] 
lists the variables in the WL likelihood function that can be replaced by parametric functions. The 
nuisance variables are functions of: wavevector h] redshift z\ and redshift difference Az — z a — z% 
between the nominal and true redshifts of a source subset. In later sections we will describe the 
parametric functions that we have implemented to reduce the number of degrees of freedom in the 
model. Each time we introduce a parametric function, we need to choose a fiducial parameter set 
and a prior distribution for the parameters. 



3. Spectroscopic Redshift Likelihood 

If we draw a single member from source set a and measure its spectroscopic redshift in an 
unbiased fashion, then by definition the likelihood of the spectroscopic redshift being on shell i is 
p a i. If we measure N a pec redshifts, and find that N^ ec are on shell i, then the likelihood is 

lnL = Y,N% ec \np ai . (23) 

i 

This is true if the redshifts are statistically independent, which requires that they be dispersed 
across the sky to eliminate source correlations. We assume this limit. 



Following iMa &; Bernstein! (|2008l ). the Fisher matrix for the parameters {p a i} resulting from 



the unbiased spectroscopic observations is 

'3 2 (-lnL)\ _ 



Fapj = { " „ ' ) = ^ pe — • (24) 
\ dp ai dp aj I p Qi 

We add this Fisher information to the density-lensing Fisher matrix (I22h when considering the 
constraints offered by a WL survey that is combined with an unbiased spectroscopic redshift survey 
drawn from one or more of the source population sets. 

We do not in general presume any functional form for the p a i redshift distributions when 
the sets are assigned from photo-z's. We adopt fiducial values either from an analytic form or 
from a simulation of photo-z performance. Then we leave all the p a i as free parameters in the 
Fisher matrix, adding the spectroscopic-survey Fisher matrix (|24p if appropriate to the planned 
experiment. Note that the cross-correlations in the WL survey data offer constraints on the redshift 
distribution even if there is no unbiased spectroscopic survey (N a pec = 0). 
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4. An Implementation 

A package of C++ classes implements the Fisher matrix calculation for Gaussian lensing+density 
observations, the spectroscopic-survey Fisher matrix, plus the models for lensing-potential power 
and nuisance functions described below. From these classes we can construct numerous applications, 
the most obvious being a Fisher-matrix forecast of cosmological constraints from the combination 
of a photometric galaxy lensing/density survey, plus a redshift survey to constrain the photo-z 
distribution. We list in Table [2] input fields for this forecasting implementation. Further program 
inputs are listed in later sections which detail the models for the lensing power spectrum and 
nuisance parameters that we describe below and adopt for this implementation. 

In the current implementation we assume the source galaxies to be binned solely by photo-z, 
but generalizations are possible, e.g. including a second population of source galaxies sets that 
are observed spectroscopically. Note that when additional galaxy populations are introduced, we 
need a new set of nuisance functions to describe them. Furthermore, we need to model the cross- 
correlations between all galaxy populations. 

The calculation of the Fisher matrix takes < 1 minute per multipole bin using a single core of 
a typical current-epoch desktop CPU. Total execution time for a forecast is 10-20 minutes with the 
default parameters, with the most time-consuming operation being the marginalization over bias 
model parameters. The execution time is very sensitive to the redshift-shell width A In a and to the 
width of the photo-z error distribution, as these control the number of subsets and the parameter 
count. 

5. Power Spectrum Models 

In most cosmological models, theory will offer strong guidance to the form of the lensing- 
potential power spectrum P^(k,z). In most forecasting or data-reduction codes this is a fully 
deterministic function of a small number of cosmological parameters. In our analysis, however, the 
theoretical prediction is taken as the mean P^ value of a prior distribution of finite uncertainty. 

5.1. Central Model 

The WL likelihood given above can be calculated for any model that predicts P^. We have 
chosen to implement a model that allows for failure of General Relativity in describing growth of 
structure; but other models are possible if one wishes to test the Poisson equation or other tenets 
of General Relativity. Under the conditions 

1. The potential and the mass-energy density are related by the Poisson equation of General 
Relativity, and 
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Table 1. Nuisance variables that can be replaced by functions 



Description 


Discrete variables 


Parametric function 


ijeiisiiig, puLeiiLidi puwei specniim 


r i 


r {K,z) 


Shear calibration error 


fai 


f(z,Az) 


Magnification bias 


Qai 


q(z,Az) 


Redshift distribution 


Pai 


p(z,Az) 


Source density bias 


b 9 - 

"at 


b 9 (k,z,Az) 


Density-mass correlation 


g 
r ai 


r 9 (k,z,Az) 


Intrinsic alignment bias 


b K . 

w ai 


b K (k,z,Az) 


IA-density correlation 


ai 


r K (k, z, Az) 


Density-density x-correlation 


r a% 


rM(k,z,Az a ,Azp) 


Density-IA x-correlation 


gn 
r a/3i 


r3-(k,z,Az a ,Az p ) 


IA-IA x-correlation 


' a(3i 


r KK (k,z,Az a ,Azp) 



Table 2. Controlling inputs to Fisher forecast 



Parameter name 


Description 


Default 


fsky 


/ sky , imaging sky coverage 


0.5 


minLogL 


log 1 o^mim minimum multipole 


1.0 


maxLogL 


log 10 ^max> maximum multipole 


3.5 


logLStep 


A log 10 £, multipole bin width in dex 


0.3 


zmax 


•Zmax, redshift of most distant shell 


3.5 


dlna 


A In a, width of distance shells 


0.03 


coreDLna 


Maximum ln(l + Zi)/(1 + z a )\ of subsets 


0.15 


sigGamma 


a 7 , shape noise per source galaxy 


0.24 


zdist 


String specifying fiducial source n a and p a i 




logNSpec 


logio Aspec 


5. 


outf ile 


Root name for output files 
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2. Non-relativistic matter is the only significant inhomogeneous component of the Universe, i.e. 
there is no dark-energy clustering, and 

3. Matter is conserved, p m oc a~ 3 , and 

4. $ = —VP, as in the absence of anisotropic stress for General Relativity 



then the potential power spectrum is related to the matter-density fluctuation spectrum P rn via 

2 



k A P*{k,a) 



~2a 



P" 



, a . 



(25) 



Under these conditions, the linearized perturbations to the metric grow in a scale- free manner so 
we can write 

Pi(k,a)=gl(a)P* im (k)T\k). (26) 
In our current code, the primordial power spectrum is a power law 



A 2 



A: 3 



prim(A^) — ^ > prim(^) 



3A 



(k/k 



tUs — 1 



(27) 



The curvature variation A^ and spectral index n s are free parameters. A running of the slope could 
easily be added. The normalization wavenumber k$ m ust be set by some con vention. We typically 
adopt the 5-year WMAP parameters as fiducial values iKomatsu et al.l (|2008l ) . 



The transfer function T{k) is taken from lEisenstein &; Hul (|1999l ). It is a function of the matter 
and baryon densities oj m and w&. The impact of massive neutrinos could be added to the transfer 
function if desired. We ignore the baryon acoustic oscillations; experiments that try to exploit them 
will generate a distinct Fisher matrix for them. 



The map from to the nonlinear P^j is derived using the prescription of lSmith et al\ ( 20021 ) 
for nonlinear P m , combined with the Poisson equation (|25p . The Smith et al. formula also requires 
knowledge of O m at the desired epoch, but it can be expressed in ter ms of other quantit ies that are 



already in our model: Q. m {zi) = oj m a i 3 /i 2 {zi). We do not expect the lSmith et all i 20021 ) formula to 



describe non-linear growth to high accuracy for all (or any) cosmologies. It does however capture the 
dependence of non-linear power on cosmological parameters to a level that suffices for forecasting 
purposes. 



In General Relativity, the growth function 
Defining F = ln(a^), the growth equation is 



(a) is determined by the expansion history H(z). 



F" + {F'f + F'(2 + d ^ 
* a in a 



3uj m 
2W ; 



(28) 



where a prime denotes differentiation with respect to In a. Note that F' is the quantity d In g m /d In a 
that appears in the peculiar-velocity power spectrum for a tracer of mass. 
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If GR holds, then the above relations fully specify the model for P^ given {u! m , u>f„ n s , In A^} 
plus the expansion history, which in turn is given by {-Di,cjfc}. As a test of GR, we allow the 
growth function arbitrary deviations from the -Ffid that solves the GR growth equation for the 
fiducial expansion history: 

hiag^ai) = F fid (a;) + 5^. (29) 
The 5Fi become parameters of the likelihood function. 



5.2. Model Errors 

An important WL systematic is the expected finite accuracy in theoretical modeling of the 
power spectrum. We hence introduce an error function to describe the describe the (logarithmic) 
difference between the power P^ and the value predicted by the parametric model described in the 
previous paragraphs: 

lnP*(Jfe,a) = hiP^(k,a) + 6lnP(k,a). (30) 

The nuisance function 5\nP will be described with the u kz n parametric form described in Ap- 
pendix [Al We parameterize the <51nP function by its values 5Pij at a grid of points (k{,aj) 
regularly spaced in In A: and In a. The JlnP is linearly interpolated between grid points. The SPij 
become free parameters of the model and hence parameters in the likelihood function. We then 
place an independent Gaussian prior on each SPij which has mean of zero and a standard deviation 
of 

1 + 51og 10 (fci/fci) ki > kx 



Va r( «y = 0.012/^^ — " — (31) 



kx = a-^Mpc- 1 . (32) 
This function is a fit to an estimate, supplied by Hu Zhan, of th e impact of baryonic physics on 



the mass power spectrum IZhan fe Knox! (|2004l ) : I Jing et al\ (|2006l ). We scale the overall size of the 



theory-error systematic with the control scalar ,/zhan- We can also adjust the density A In A; and 
A In a at which the 5Pij grid points are spaced. This corresponds to setting some coherence length 
for theory errors in this space. In £J7]we will investigate the choice of these grid spacings. 

The procedure above means that we replace the Pf as parameters in our likelihood with a new 
(and hopefully smaller) set: 

• The small set {to m , u>b, A^, n s } that control the linear power spectrum. 

• The {5Fi} which define the growth function vs redshift. 

• A grid of theory error values 5P, which are nuisance parameters to marginalize after con- 
struction of a Fisher matrix or likelihood. We have physically-based priors to apply to these 
before marginalization. 
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Table [3] lists the input fields for the part of our forecasting code which constructs the power- 
spectrum model. The WMAP5 ACDM cosmology provides the fiducial values of all lensing power 
values; the program inputs define the behavior of the deviations from the theoretical model and 
the prior expectations on the size of such deviations. 

6. Non-parametric nuisance modeling 

6.1. General comments on nuisance functions 

The WL likelihood contains many nuisance parameters that are discretized representations 
of nuisance functions. It is common in the literature to assign some parametric form to a nui- 
sance function, then marginalize over the parameters of the nuisance function to recover a purely 
cosmological likelihood. This can be a very dangerous approach: if the nuisance function does 
not in actuality follow the assumed form, then the process is invalid and we may have greatly 
overestimated the power of the experiment to remove the systematic error from the signal. When 
marginalizing over a systematic, we must be sure that the assumed parametric form is sufficiently 
flexible to include any expected manifestation of the systematic. For example we should not assume 
that systematics scale linearly with redshift unless there is a physical reason to expect this. 

It is unfortunately not possible to model a completely free function with a finite number of 
free parameters. This becomes possible, however, if we limit the bandwidth of variation in the 
function. As an example consider our power-spectrum theory error function <51nP(fc, z). We could 
decompose 5lnP into Fourier modes or polynomial terms over its finite (k,z) domain. Retaining 
a finite number of modes or terms leads to a tractable parameterization, albeit with a maximum 
frequency or polynomial order that defines a coherence length for the reconstructed function. For 
5 In P we choose to limit the bandwidth using linear interpolation between a 2d grid of specified 
values. In Appendix |A] we describe the family of functions that we use to model the nuisance 
functions of (k, z, Az) that are common in the WL likelihood analysis (cf. TableH])Q The Appendix 



1 In practice we use In a and A In a to specify each subset's nominal redshift and redshift error, but in the text we 
will stick with (z, Az) to reduce the clutter. 



Table 3. Power spectrum inputs to Fisher forecast 



Parameter name 




Description 


Default 


zhan 


/zhan > 


power-spectrum theory uncertainty relative to baryonic effects 


0.5 


psDlnk 


A In A;. 


node spacing of power-spectrum theory errors in A: 


1.0 


psDlna 


A In a. 


node spacing of power-spectrum theory errors in a 


0.5 
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describes both the functional form, and the prior likelihoods on the parameters that are used to 
give the nuisance function the desired RMS uncertainty. 

These models of nuisance functions are non-parametric in the sense of being able to reproduce 
very general types of behavior once the bandwidth is specified. The question remains: what is the 
proper choice of bandwidth to allow the nuisance function? Our approach is to find the bandwidth 
which causes the most damage to cosmological constraints under a prior that specifies the expected 
RMS fluctuations in the nuisance function. This is the most conservative approach. Typically one 
finds the following: if the nuisance function is given a highly coherent, low-order functional form, 
then it is easily distinguished from cosmological signals and can be marginalized away with little 
damage to cosmological constraints. On the other hand if the nuisance-function bandwidth is very 
high, then the broad WL kernel tends to average away the nuisance signal, leaving little trace in the 
cosmology. There is an intermediate point where the systematic error is most easily confused with 
cosmology. The conservative approach is to find this regime and use it for modeling the systematic 
error. In SJ7] we will find coherence lengths in z and k at which our systematics are most damaging. 

6.2. Redshift Distributions 

We specify the fiducial values of the photo-z distribution n a and error probabilities p a i either 
with analytic formulae (e.g. photo-z errors Gaussian in In a), or by taking the output of a simulation 
of galaxy detection and photo-z assignment for the chosen survey. For spectroscopic samples or the 
CMB source plane there are no photo-z errors at all. 

We do not place any parametric form or prior assumption on the p a j. All redshift constraints 
arise either from the lensing survey data itself or from additional spectroscopic data. The likelihood 
arising from spectroscopic redshift samples is described in £j3l 

6.3. Shear calibration and magnification bias 

The shear calibration factors f a i and magnification bias coefficients q a i are, most generally, 
distinct in every subset. We use the ll zAz" functional form described in Appendix lAl to generate 
the f a i and q a i from a smaller set of function parameters. Each function is specified by a polynomial 
function of Az; polynomial coefficients are interpolated between grid points equally spaced in In a 
at intervals A In a. 

We set the fiducial functions to be f a i = 0, and q a i = q^d independent of (k, z). The priors on 
the polynomial coefficients are chosen to yield a chosen RMS variation of / or of q. As detailed in 
Appendix |Al we also specify whether the nuisance function varies mostly along the z direction or 
along the Az direction of its domain. 

Table H] lists the program inputs necessary to specify the model for / or q: their functional 
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form, fiducial values, and priors on deviations from the fiducial. 

6.4. Galaxy correlation coefficients 

The correlation coefficient r 9 ai is, most generally, different at each subset and at each multipole 
t. We model r 9 using the ll kzAz v function form described in Appendix El These functions are 
polynomial in Az, with the polynomial coefficients linearly interpolated from a grid in (In /c, In a) 
space. This grid of polynomial coefficients replaces the r 9 ai as parameters in the likelihood. 

For the fiducial correlation coefficient, we interpolate smoothly between a linear and non-linear 
limit according to the value of Aj 2 in (/c, z) = k 3 P^(k, z)/2tt 2 : 

^(jfe, z) = r NL+ r L + V NL r L) tan - 

The constant W sets the width of the transition from the linear to nonlinear regime. We set W = 1 
unless otherwise noted. 

Each polynomial coefficient at each grid point is assigned an independent Gaussian prior. 
These are selected to yield a preselected RMS variation r^ Mg . The RMS prior uncertainties are 
interpolated in (k, z) space between linear and non-linear limiting values rf^ MS L and ^^ MS NL using 
the same functional form (|33l) . 

Table [5] lists the program inputs needed to specify the galaxy bias and correlation models. 

6.5. Galaxy bias 

We expect W to vary quite strongly with z, as the more distant source galaxies are likely 
intrinsically very bright and highly biased. There may also be a strong dependence of b 9 on Az 



Table 4. Shear calibration and magnification bias inputs to Fisher forecast 



Parameter name 


Description 


Default 


f RMS 


/rmSj RMS variation of / allowed under prior 


0.01 


qFid 


<7fid, fiducial value for all q a i 


1.0 


qRMS 


<7RMS> RMS variation of q allowed under prior 


0.1 


fqDlna 


A In a, node spacing for / & q models 


0.5 


f qDzOrder 


Order of polynomial used to model Az dependence of /, q 


2 


f qVarFracDZ 


Fraction of /, q variance that due to Az dependence 


0.5 



In Ai in (fe,z) 
W 



(33) 
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because both Az and the bias may couple strongly to galaxy spectral type. Variation with k should 
be weaker. We hence define b 9 to be the sum of two functions, 

b s = b 9 oaTBe (z,Az) + b^ ne (k,z,Az). (34) 

The fiducial values are hoarse = b^ d , 6| ne = 0. 

The coarse contribution is given a very weak prior, but can only vary slowly with z: A In a = 0.5 
by default for the hoarse S r id nodes. 

The 6g ne function is interpolated between the same (Ink, In a) grid points as the correlation 
coefficient r 9 . The prior on each 6| ne node is arranged to give RMS uncertainty that is interpolated 
between linear and nonlinear limits &r MS l and &p> MS NL just as for r 9 . 



6.6. Intrinsic Alignments 



The strength of intrinsic alignments are specified by the 6^ and values at each multipole 
t. As for the galaxy density, the free-parameter count can be reduced by specifying parametric 
functions b K ,r K of (k,z, Az) instead. Each of these two functions modeled using the "fczAz" form 
described in Appendix |Aj just as for r 9 . The fiducial 6g d and rg d are taken as constant over the 
entire domain. The RMS variation b^ MS and r^ MS in the priors of these functions are also taken to 
be constant over the domain. We take b^ MS = |6g d |, because we expect that the best constraints on 
IA to always arise from self-calibration of WL s urveys rather than through any external modeling or 
prior. Roughly speaking, the IA measured by iMandelbaum et al.l (|2006l ) for the SDSS population 
corresponds to b K ~ —0.003 ([Bridle &: Kind 120071 ). which we will normally take as our fiducial 
model for IA. Setting 6g d = turns off the IA systematic entirely. 

Table [U lists the program inputs needed to specify the functional form for intrinsic alignments, 
the fiducial values, and the prior constraints. Note that we assume the intrinsic-alignment functions 
to be defined on the same (In k, In a) grid as the galaxy bias and covariance functions. 



6.7. Cross-correlation coefficients 



The cross-correlation coefficients > r a fii > an< ^ r api even more complex because each 
depends on k,z, plus two subsets' Az a i and Azpi. We find it infeasible to construct nuisance- 
function templates spanning 4 dimensions. We therefore simplify by first writing 



r 9« 

a/3i 



ai' Pi 



+ s 



af3i 



[1 



r.9 \21 



1 



' I3i> 



(35) 



A value < 1 is necessary (but not sufficient) to keep the mass-galaxy covariance matrix 

from acquiring non-physical negative eigenvalues. In principle the functional form of s 9K must vary 
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Table 5. Galaxy bias inputs to Fisher forecast 



Parameter name 


Description 


Default 


b g 


6| d , fiducial galaxy bias 


1.5 


rgL 


r£, fiducial galaxy correlation coeff at linear limit 


0.9 


rgNL 


r^ L , fiducial galaxy correlation coeff at non- linear limit 


0.6 


biasDlnk 


A In k interpolation grid step for bias models 


1.0 


biasDlna 


A In a interpolation grid step for bias models 


0.1 


brgRMSL 


^RMS L an d r RMS L RMS prior variation for bias and correlation, linear limit 


0.05 


brgRMSNL 


^RMS NL an d r RMS NL RMS prior variation non-linear limit 


0.10 


brVarFracDZ 


Fraction of b 9 ,r 9 variance due to Az dependence 


0.2 


bgCoarseDlna 


A In a node grid spacing for hoarse 


0.5 


bgCoarseZRMS 


RMS prior variation of 6? arse at Az = 


0.5 


bgCoarseDZRMS 


RMS prior variation of hoarse at fixed z 


0.3 


kgg 


K^, fiducial value of K"(k,z) cross-corr spec 


2. 


kggRMS 


-^RMS' P r i° r variation on K g9 (k,z) 


1. 



Table 6. Intrinsic-alignment (IA) inputs to Fisher forecast 



Parameter name Description Default 



bk 


6g d , fiducial intrinsic alignment 


-0.003 


rk 


rg d , fiducial correlation between mass and intrinsic alignment 


0.7 


rkRMS 


r RMS' RMS prior variation in r K 


0.2 


kkk 


Kll, fiducial value of K KK (k, z) 


1. 


kkkRMS 


-K"rms> RMS prior variance of K KK 


1. 


skg 


s| d , fiducial IA-bias cross-correlation 


0. 


skgRMS 


S RMS' RMS prior on IA-bias cross-correlation 


0.3 
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over four dimensions, but we make the gross simplification that it is constant for the survey, since 
we expect this type of cross-correlation to have minimal effect on cosmological constraints. The 
program thus requires simply a fiducial scalar s 9K and an RMS for its Gaussian prior. 

The density-density cross-correlation r" may have substantial impact on cosmological con- 
straints, so we model it with more freedom, though not full 4-dimensional behavior. We set 



99 

S a% = maX 



0,1 -K^(k,z^ Za M 



2Az 



max 



(36) 



This functional form for s gives the most closely related subsets the highest covariance. Az max is 
the width of the redshift distribution within a set. The function K g9 (k, z) adjusts how quickly the 
subsets decorrelate as their photo-z's diverge. K"(k, z) is implemented as the il kz" functional form 
described in Appendix [A] namely a linear interpolation between a grid of values in the (lnfc, In a) 
plane. We hence parameterize the 4-dimensional cross-correlation function by a 2d grid of K" 
nodal values. These points are all given the same fiducial value K^. The K" nodal values are 
given Gaussian priors to select a range of uncertainty Kj^ MS . 

The cross-correlation parameters r*^ are similarly reduced from 4-dimensional behavior by 
defining values s^L that are set by the nodal values of a 2-dimensional function K KK (k,z) in 
complete analogy with Equation ([36]) . 



6.8. An apology 



This section on nuisance functions is obscure and lengthy, especially regarding the cross- 
correlations of galaxies and intrinsic alignments. It is, unfortunately, impossible to fully describe 
the likelihood of lensing survey data without invoking some model for all of these functions. 

Previous work has avoided these messy details and functions by making many simplifications. 
Most have implicitly assumed that all the correlation coefficients are unity. Most have ignored 
the intrinsic-alignment signal entirely, i.e. taking b K = 0. All previous analyses have considered 
the galaxy bias to be constant within a set, and if the multiplicative error has been considered, 
it has also been co nstant within a galaxy set. Only a few analyses have allowed galaxy bias to 
vary with redshift flZ han 2006; B ernstein &: Jainll2004l ). The most sophisticated treatment to date 
is that of iHu fc Jainl (J2004J), who take all bias and correlation coefficients to derive from a halo 
model of galaxies. The redshift-distribution parameters p a i have , in the most ambitiou s analyses 
to date, been reduced to two-parameter (Gaussian) functions. iMa &: Bernstein! (|2008l ) consider 
sum-of-Gaussian models. These assumptions can all be implemented in the present formalism if 
desired, but can also be relaxed to assess their impact on the cosmological constraints. 
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7. Tuning the forecast parameters 

In this section we determine the values of bin widths and nuisance-function bandwidths that 
are needed for reliable extraction of maximum information from lensing surveys. Unless otherwise 
noted, we will derive these parameters for a canonical survey with f s k y = 0.5; an effective source 
density of 60 galaxies per arcmin 2 with median redshift of 1.2; cx 7 = 0.24; and Gaussian-distributed 
fiducial photo-z errors of a z = 0.04(1 + z). Except as noted, we assume N spec = 10 7 so that photo-z 
calibration errors are negligible, and also reduce the shear calibration RMS uncertainty to 10 -4 to 
be negligible as well. Other inputs assume the default values given in Tables [2H6] 



The information content of a survey will be gauged using the DETF figure of merit (jAlbrecht et al 



2006): the Fisher matrix will be marginalized over all nuisance parameters, then the D{ and 6Fi 
variables projected onto a model obeying General Relativity with homogeneous dark energy of 
equation of state w = wq + w a (l — a). A prior representing expected Planck results is added 
(also from the DETF report), and we marginalize over {u m , uj,, uik, wdEj n> s , A^} to yield the Fisher 
matrix F w over {w ,w a }- The DETF FoM is defined as (F^l 1 / 2 . 



7.1. Multipole bin size 

Since we have excluded baryon oscillations from our transfer function, we expect to find little 
information in the detailed shape of the lensing or density power spectra. The broad lensing kernel in 
redshift also smoothes away fine structure in the convergence. So we expect the information content 
in the Fisher matrix to be independent of the multipole bin width A log 10 1 below some modest 
value. Larger values of Alog 10 ^ reduce the complexity and execution time of the calculations, so 
we seek the maximum A log 10 ^ a ^ which nearly all the lensing information is present. 

Figure Q] plots the DETF FoM of the lensing+density survey (plus spectroscopic redshift survey 
and Planck prior) vs Alog 10 ^ for several candidate surveys. The top line is for a very optimistic 
survey: n eS = 100 arcmin" 2 , N spcc = 10 7 , q RM s = 10™ 3 , /rms = 10" 4 , 6^ MS = r| Mg = 0.01, and 
6g d = 10~ 3 . By reducing the systematics and the shot noise to (unrealistically) low levels, we give 
the lensing survey the chance to extract maximal information. We find that the FoM gains only 
2% for Alog 10 £ < 0.3. 

Other lines in the plot are FoM vs Alog 10 ^ for weaker surveys, with N spec = 10 4 - 5 and/or 
n e fj = 60 arcmin -2 , ^rms = 0.1, fr RMS = ^rms = 0-l> = — 0.003. In these cases we also find that 
the DETF FoM increases by < 2-3% for Alog 10 ^ < 0.3. 

We adopt A log 10 1 = 0.3 for all future use. 
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Fig. 1. — Left: DETF Figure of Merit resulting from Fisher analyses of several weak lensing surveys, 
plotted against multipole bin width Alog 10 £ of the analysis. Each solid line plots a particular 
survey scenario (see text for details). The dashed lines are horizontal, to help the eye judge the 
information degradation as we increase Alog 10 £ We conclude that Alog 10 ^ < 0.3 retains > 97% 
of the information for surveys of any quality level. Right: Value of DETF FoM for the default 
survey vs range of multipole used. Choice of upper bound is more critical than choice of lower 
bound on I. 
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7.2. Multipole range 

On the right-hand side of Figure Q] we plot the DETF FoM for various ranges of £. In this 
study we assume a space based survey obtaining n c g = 60arcmin -2 over / s k y = 0.5. The photo-z 
and shear calibration systematics are held negligible with priors but other systematics (intrinsic 
alignment, etc.) have default priors. 

We see that the choice of £ max has a strong influence, as moving from 10 3 to 10 4 changes the 
FoM by 1.7x. We note this is true even though we have included uncertainty in the theoretical 
power spectrum at high k values, showing that there is still information to be gained when the 
theory is incomplete. We find, in fact, that our default power-spectrum theory uncertainty of 
/zhan = 0.5 leads to only 6% degradation of the DETF FoM relative to an assumption of zero 
uncertainty in the theory, even when l max — 10 4 . 



Unfortunatel y our assumption of Gaussian statistics will fail by £ = 10 ICooray Hul (|2001 



Lee Sz Pen! (|2008l ). rendering the Fisher calculation less reliable. We will restrict our analysis to 



£ < 10 35 , but additional study of the effect of non-Gaussian statistics is clearly needed. 

The flat-sky and Limber approximations will fail at low I, but the choice of £ m \ n appears less 
critical to the wo/w a information content, so will retain the £ > 10 bound in our analyses. 



7.3. Scale resolution for nuisance functions 

We require choice of node spacing A In A; in the nuisance functions for the power-spectrum the- 
ory errors, the calibration errors / and q, and the bias/correlation parameters b^ ne ,r 9 ,b K ,r K , K" , 
and K KK . We set A In A; to be equal for all nuisance functions, and find the value which minimizes 
the DETF FoM as the "most damaging" scale of variation. We examine the default case described 
above, for several values of the shear calibration prior /rms an d photo-z calibration size N spec . 

Figure [2] shows Alnfc ~ 0.7-1 yields minimum information for fixed RMS priors, but the 
dependence is very weak. The FoM varies by only 7% over the range 0.5 < A In k < 1.5. We 
henceforth adopt A In k = 1. Perhaps not surprisingly, this makes the nuisance functions have ~ 1 
independent node in each multipole bin of Alog 10 ^ = 0.3. 



7.4. Redshift resolution for nuisance functions 

All of the nuisance functions are dependent on z. We next investigate the redshift node spacing 
A In a at which the nuisance functions are most damaging to the DETF FoM. We find that the 
FoM is insensitive to the A In a of the power-spectrum theory errors. The value of A In a for the 
calibration functions / and q that minimizes the FoM depends upon the strength of the prior. The 
choice A In a = 0.5 produces a FoM that is within 2% of the minimum, however, so we fix this value 
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0.4 0.6 0.8 1.0 1.2 1.4 1.6 0.1 0.2 0.3 0.4 

A In k A In a 

Fig. 2. — Left: DETF FoM vs A In k, the spacing of nuisance-function nodes in the length-scale axis. 
We examine surveys with varying strengths of priors on shear calibration and photo-z calibration. 
For this plot, each is normalized to the value at A In A; = 0.7 in order to show the (weak) dependence 
of FoM on A In A; when the prior on RMS nuisance- function fluctuations are held fixed. We adopt 
A In k = 1 as the most conservative bandwidth for nuisance-function variation with scale. Right: 
DETF FoM vs A In a, the spacing of nodes in redshift for the bias, correlation, and intrinsic- 
alignment nuisance functions. The FoM is quite sensitive to this choice, and allowing the bias/IA 
to vary on A In a = 0.1 scales is most damaging to cosmological inference. 
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for the theory-error and calibration nuisance functions. 

The redshift freedom given to the bias and intrinsic-alignment nuisance functions has a strong 
impact on the DETF information content. Figure [2] illustrates that, at fixed RMS prior variation, 
models with freedom to vary on rather fine scales, A In a = 0.1 are most damaging to cosmological 
information. 



8. Conclusion 

The core of this paper are the expressions f)18j) — (|20|) for the two-point correlation matrix of the 
lensing and density observable multipoles produced by a typical lensing survey. This was derived 
under a very limited set of assumptions: a homogeneous and isotropic 4-dimensional metric Universe 
with scalar perturbations; plus the weak-lensing limit, the Limber and Born approximations, and an 
approximation that lensing magnification bias and intrinsic density fluctuations are additive. The 
last four assumptions could be relaxed at the expense of computational complexity. We thus hope 
that data analyses based on this framework could be used to constrain a wide variety of potential 
explanations for the acceleration phenomenon, including gravity modifications as well as new fields 
in the Universe. In the limit of Gaussian fluctuation fields, the two-point information is a complete 
description of the likelihood and hence can be used to construct Fisher matrices or analyze data. As 
currently configured, the analysis yields the survey's ability to constrain the distance function D(z) 
and linear growth function g<j,(z), without reference to particular dark-energy models. It would be 
straightforward to implement scale-dependent linear-growth functions. 

This framework subsumes all of the information (up to 2-point level) that is likely to be 
obtained from lensing observations: density-density, lensing-density, and lensing-lensing correla- 
tions, plus redshift distributions from unbiased spectroscopic surveys ($3|). Furthermore it allows 
for the most important expected forms of systematic error: photo-z calibration errors, shear and 
magnification-bias calibration errors, intrinsic alignments, and inaccuracies in power-spectrum the- 
ory. Systematics that are additive to shear (e.g. uncorrected PSF ellipticity) or to density (e.g. 
uncorrected foreground extinction) have not been included. We have not done so since the additive 
errors could, in principle, exhibit almost any arbitrary signature in the covariance matrix of the 
observables. Hence a completely general model for additive errors would be degenerate with almost 
all other signals. For the additive systematics, it is better to det ermine the level at which they 



would bias the cosmological results than to attempt to fit a model. lAmara Refregieri (|2007l ) is a 
good example of this approach. 

Since the analysis framework is independent of models for dark energy, gravity, power-spectrum 
evolution, or galaxy bias, we get a stripped-down look at what parameters are truly constrained 
by the data, and what nuisance functions must be modeled in order to extract the cosmological 
information. There is a substantial suite of biases and correlation functions involved in under- 
standing the full survey data. In other work these have been ignored, or have been quantified by 



reference to halo occupation models (|Hu Jainl 12004 ; ICacciato et aZ.1 120081 ) . Here we introduce 
generic functions for bias and calibration nuisance functions that are not based on any particular 
physical model. 



We implement one possible model for the evolution of the lensing-potential power spectrum, 
based on General Relativity but allowing for failure of the growth equation. It is straightforward 
to implement other potential deviations from General Relativity. In the current implementation, 
the end result of the Fisher analysis is a forecast of the ability to constrain the functions Da(z) 
and g<j,{z). 

Since the analysis must be discretized in redshift and angular scale in order to be feasible, 
we investigated the bin sizes or bandwidths of nuisance functions that should be chosen. We find 
that ~ 3 bins per decade of angular scale suffice to extract all information (apart from baryon 
acoustic oscillations), and that nuisance functions should be specified no finer than this. Nuisance 
functions for power-spectrum theory errors and for shear and magnification-bias calibration errors 
can be specified coarsely in redshift space (A In a ks 1), but the galaxy biases, correlations, and 
intrinsic-alignments must be modeled with potentially finer structure in redshift (A In a ~ 0.1) to 
immunize against potential astrophysical systematics. 

In future papers we will use this framework and its implementation to investigate the require- 
ments for spectroscopic calibration of photo-z's in large lensing surveys, and other practical issues. 
As a simple first application of our framework, we have shown here that power-spectrum theory 
uncertainty does not significantly degrade the cosmological power of a nominal lensing survey at 
10 < i < 10 4 . Non-Gaussian statistics are a much more important factor to consider. 

C++ Code to implement Fisher forecasting using this framework has been written and runs 
quickly on desktop computers despite the large number of free parameters in these general models. 
Interested parties should contact the author for access to the code. 

This work is supported in this work by grant AST-0607667 from the National Science Foun- 
dation, Department of Energy grant DOE-DE-FG02-95ER40893, and NASA grant BEFS-04-0014- 
0018. I thank Bhuvnesh Jain, Zhaoming Ma, Chris Hirata, Ravi Sheth, Hu Zhan, and the members 
of the Dark Energy Task Force for helpful conversations during the long gestation of this work. 

A. Parametric functional forms for nuisance variables 

In modeling an experiment, we often encounter some systematic error associated with a nui- 
sance variable / about which we have little a priori knowledge. We would like to fit some parametric 
form to this variable, but would like a form that is flexible enough to describe any "reasonable" 
behavior the function might exhibit. We also want to conveniently relate the number and prior 
probabilities for the parameters to the kind of variation that / might exhibit. A parametric de- 
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scription of some nuisance function / denned over a variable x € [—1,1] would ideally have the 
following properties: 

1. /(x) has a variable number N of controlling parameters {do, 02, • • • , o/V-i} such that any- 
continuous differentiate function F(x) can be approximated to any desired accuracy with a 
sufficiently large choice of N. 

2. We can draw {aj} from independent Gaussian distributions of zero mean and widths {<Jj}, 
with the result that Var[/(x)] is independent of x. In other words, the nuisance value / has 
a uniform and well-determined variance when we apply a simple diagonal Gaussian prior to 
the parameter set {aj}. 

A Fourier decomposition, / = YK a j s * I1 3' KX + bj cos jnx) , exhibits these qualities, but converges 
poorly when /(— 1) 7^ /(+!). 



A.l. Linearly interpolated functions 

Another approach is linear interpolation: choosing a spacing Ax = 2/(N — 1), we define aj as 
the value of / at Xj = iAx — 1. At some other Xj < x < Xj+i, we define 

f(x)=wai + (l-w)a i+1 , W = ~ J ~^~' ( A1 ) 

If we assign an independent Gaussian prior of width a a to each aj, then by definition we have 
Var[/(x)] = a\ if x coincides with a node. But the variance of / is not quite homogeneous: it drops 
to Var[/(x)] = cr^/2 when x is halfway between two nodes. If we want the mean variance of /(x) 
over the interval x G [-1,1] to equal aj, then the variance of the prior on each node needs to be 
°l = 3^/2. 



A. 2. Legendre polynomials 

Polynomial expansions are also commonly used to model nuisance functions. The simplistic 
form /(x) = ciiX 1 results in extremely non-uniform variance for / with diagonal prior on {<2j} 
and is hence inappropriate for our purpose. A better choice is to expand in Legendre polynomials 
P n (x), which are orthogonal over [—1,1]. We define 

N-l 

f{x) = Y, aiPi(vx). (A2) 

i=0 

Recall that the Legendre polynomials satisfy Pq = 1, P\ = x, (n + l)P n +i = (2n + l)xP n — nP n -\. 
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We wish to choose priors {crj} on {at} that cause the variance of f(x) be as uniform as 
possible for x € [—1, 1]. We have not found a way to attain perfect uniformity in x with polynomial 
interpolation, however the following scheme gets usefully close. We discover numerically that 

-i N—l 

j im 7J e ^ w(« + i )=U i - ■ (A3) 

iV— >oo iV ^ — ' 7T 

i=0 



This implies that, if we set Oi = \J (2i + l)ir/2N, then in the limit of large N we will obtain 

— 1 /2 

Var[/(x)] = (l — v 2 x 2 } . For v = 1 this would diverge at the ends of our nuisance function's 
interval. If, however, we choose v = 0.9, the RMS is only « 1.5 x larger at the endpoints than 
at x = 0. Over the [—1, 1] interval, the mean variance is (sin -1 v)/v. Hence if we wish to have a 
function with (V&rf(x)) x = aj-, we set the priors on the Legendre coefficients to be 



,J {2i+1) T. (A4) 



A. 3. Standard multidimensional functions 

The Fourier, linear-interpolation, and Legendre-polynomial functional forms can each be ex- 
tended to > 1 dimensions in straightforward fashion. In the lens modeling, we need functions of 
these dimensions: 

1. Comoving wavenumber or physical scale: x\ = In A;; 

2. Redshift z: more precisely we will use the variable X2 = m(l + z); 

3. Photometric redshift error Az; more precisely, our code uses the variable X3 = A ln(l + z) = 
ln[(l + z a )/(l + Zi)\ for subset ai. 



A. 3.1. kz form 

For functions over the (x\,X2) space we use a simple two-dimensional version of interpolation 
between values on a rectangular grid. The power-spectrum theory error SlnP uses this form, as 
do the K"(k, z) and K KK (k, z) nuisance functions. The only complication of note is that the nodal 
point dij should have a prior with variance a 2 a = (3/2) 2 <t| if the output function is to have variance 
Oj. The factor of (3/2) 2 is needed to counteract the reduced variance when interpolating between 
grid points. 

The kz nuisance function is specified by: 
• the spacing Axi = A In k of the nodes for linearly interpolation in X\\ 
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• the spacing Ax2 = Aln(l + z) of the nodes for linearly interpolation in X2] 

• the RMS variation cr/ of the function allowed under the prior, which can depend on x\ and 
x 2 . 

• the fiducial dependence of / on x\ and x<i- 

A. 3. 2. zAz form 

For nuisance variables over the (x 2 ,x 3 ) space (z and Az), we adopt the following strategy: we 
choose to have variation over x 3 be described by polynomials since we usually define the range of 
non-catastrophic photo-z errors to be bounded to some range |a?3 1 < A max . We define the "zAz" 
functional form as follows: 

JV-l 

f{x 2 ,x 3 ) = ^2 a i (x 2 )P i (ux 3 /A max ). (A5) 

i=0 

The Legendre coefficients are in turn defined to be linearly interpolated between a series of values 
dij at redshift nodes {ln(l + Zj)}. The a^- become parameters of the model. 

The fiducial and prior values for the i = terms (constant in x 3 ) are treated differently than 
the i > terms. We might, for example, expect some nuisance functions to vary strongly with X2 
(nominal redshift) but only slightly with x% (photo-z error) at fixed X2- 

We specify the total RMS fluctuation in / allowed by the prior to be Of. But we also specify 
the fraction VarFracDZ of the variance that is due to dependence on x 3 . If we define 

/+A max 
dx 3 f(x 2 ,x 3 )/2A max , (A6) 

then we aim to achieve 

Var[/(x 2 )] = <j) (1 - VarFracDZ) (A7) 
Var [f(x 2 ,x 3 ) - f(x 2 )] = u) (VarFracDZ) . (A8) 

This is achieved approximately by setting the priors on the constant terms as 

3 

= -aj (1 - VarFracDZ) (A9) 

and the X3-dependent terms as 
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The RMS prior variation cry can be made a function of x 2 without loss of generality. We 
typically take fiducial values of = for i > in our nuisance functions, i.e. no fiducial dependence 
upon Az. 

To summarize, the zAz nuisance function is specified by: 

• the order TV" of the polynomial in £3; 

• the maximum range A max of applicability in the X3 axis; 

• the spacing Ax2 = Aln(l + z) of the nodes for linearly interpolation in X2', 

• the RMS variation <jj of the function allowed under the prior, which can depend on x 2 ; 

• the fraction VarFracDZ of this variance that is due to X3 (Az) dependence; 

• the fiducial dependence of / on 12. 

A. 3. 3. kzAz form 

The bias and correlation coefficients can, most generally, depend on scale (xi) as well as subset 
(x2,X3), so we generalize to the "fezAz" functional form: 

f(x 1 ,x 2 ,x 3 ) = y^ai(xi,x 2 )Pi(x 3 ). (All) 

The coefficients at are linearly interpolated between nodes in the two-dimensional space (xi,x 2 ). 
Thus the free parameters of this model become the Legendre coefficients a^. As for the zAz func- 
tion, we specify the prior by the overall mean RMS variation a y (which can be a function of x 2 and 
X3), plus the VarFracDZ specifying how much of the variance is manifested as dependence on Az. 
The formulae for the priors on the nodal coefficients are derived exactly as in Equations (|A9p 
and (|A10p . except that we now need factors of (3/2) 2 to account for the reduced variance when 
interpolating in two dimensions. 

The kzAz function thus requires all of the specifications as the zAz function, plus a spacing 
A In A; for nodes in the x\ axis. 
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